A Human-Robot Collaborative Reinforcement Learning Algorithm

نویسندگان

  • Uri Kartoun
  • Helman Stern
  • Yael Edan
چکیده

This paper presents a new reinforcement learning algorithm that enables collaborative learning between a robot and a human. The algorithm which is based on the Q(λ) approach expedites the learning process by taking advantage of human intelligence and expertise. The algorithm denoted as CQ(λ) provides the robot with self awareness to adaptively switch its collaboration level from autonomous (self performing, the robot decides which actions to take, according to its learning function) to semi-autonomous (a human advisor guides the robot and the robot combines this knowledge into its learning function). This awareness is represented by a self test of its learning performance. The approach of variable autonomy is demonstrated and evaluated using a fixed-arm robot for finding the optimal shaking policy to empty the contents of a plastic bag. A comparison between the CQ(λ) and This work was partially supported by the Paul Ivanier Center for Robotics Research and Production Management, and by the Rabbi W. Gunther Plaut Chair in Manufacturing Engineering, Ben-Gurion University of the Negev. We would like to thank Dr. Amir Shapiro from the Department of Mechanical Engineering for his suggestions, and comments. We would also like to thank Dr. Sigal Berman and Mr. Amit Gil from the Department of Industrial Engineering and Management for their critical reading of the text. U. Kartoun (B) Microsoft Medical Medial Lab (M3L), Microsoft, Washington DC, USA e-mail: [email protected] H. Stern · Y. Edan Department of Industrial Engineering and Management, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel H. Stern e-mail: [email protected] Y. Edan e-mail: [email protected] 218 J Intell Robot Syst (2010) 60:217–239 the traditional Q(λ)-reinforcement learning algorithm, resulted in faster convergence for the CQ(λ) collaborative reinforcement learning algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

A Cognitive Robot Collaborative Reinforcement Learning Algorithm

A cognitive collaborative reinforcement learning algorithm (CCRL) that incorporates an advisor into the learning process is developed to improve supervised learning. An autonomous learner is enabled with a self awareness cognitive skill to decide when to solicit instructions from the advisor. The learner can also assess the value of advice, and accept or reject it. The method is evaluated for r...

متن کامل

Collaborative Q(λ) Reinforcement Learning Algorithm - a Promising Robot Learning Framework

This paper presents the design and implementation of a new reinforcement learning (RL) based algorithm. The proposed algorithm, ) ( CQ (collaborative ) ( Q ) allows several learning agents to acquire knowledge from each other. Acquiring knowledge learnt by an agent via collaboration with another agent enables acceleration of the entire learning system; therefore, learning can be utilized more...

متن کامل

Improving adaptation of ubiquitous recommander systems by using reinforcement learning and collaborative filtering

The wide development of mobile applications provides a considerable amount of data of all types (images, texts, sounds, videos, etc.). Thus, two main issues have to be considered: assist users in finding information and reduce search and navigation time. In this sense, context-based recommender systems (CBRS) propose the user the adequate information depending on her/his situation. Our work con...

متن کامل

REFERRING EXPRESSION GENERATION TOWARDS MEDIATING SHARED PERCEPTUAL BASIS IN SITUATED DIALOGUE By

REFERRING EXPRESSION GENERATION TOWARDS MEDIATING SHARED PERCEPTUAL BASIS IN SITUATED DIALOGUE By Rui Fang Situated human-robot dialogue has received increasing attention in recent years. In situated dialogue, robots/artificial agents and their human partners are co-present in a shared physical world. Robots need to automatically perceive and make inference of the shared environment. Due to its...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Intelligent and Robotic Systems

دوره 60  شماره 

صفحات  -

تاریخ انتشار 2010